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Abstract 


Set packing is a fundamental problem that generalises some well-known combi¬ 
natorial optimization problems and knows a lot of applications. It is equivalent 
to hypergraph matching and it is strongly related to the maximum independent 
set problem. 

In this thesis we study the k -set packing problem where given a universe 
U and a collection C of subsets over U, each of cardinality k, one needs to find 
the maximum collection of mutually disjoint subsets. Local search techniques 
have proved to be successful in the search for approximation algorithms, both 
for the unweighted and the weighted version of the problem where every subset 
in C is associated with a weight and the objective is to maximise the sum of 
the weights. We make a survey of these approaches and give some background 
and intuition behind them. In particular, we simplify the algebraic proof of 
the main lemma for the currently best weighted approximation algorithm of 
Berman QBerOOj l into a proof that reveals more intuition on what is really 
happening behind the math. 

The main result is a new bound of | + 1 + e on the integrality gap for a 
polynomially sized LP relaxation for fc-set packing by Chan and Lau 1 |CL10| 1 
and the natural SDP relaxation [NOTE: see page iii] . We provide detailed 
proofs of lemmas needed to prove this new bound and treat some background 
on related topics like semidefinite programming and the Lovasz Theta function. 

Finally we have an extended discussion in which we suggest some possibil¬ 
ities for future research. We discuss how the current results from the weighted 
approximation algorithms and the LP and SDP relaxations might be improved, 
the strong relation between set packing and the independent set problem and 
the difference between the weighted and the unweighted version of the problem. 
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Chapter 1 

Introduction 


1.1 Definition of the problem 

Set packing and As-set packing Set packing is one of Karp’s 21 NP-complete problems 
|Kar72j and it has received a lot of attention during the past years. A lot of progress has 
been made on the complexity of this problem, even though under standard complexity 
assumptions algorithms for this problem require at least super polynomial or perhaps even 
exponential running time. Set packing is a fundamental problem that generalises some 
well-known problems and thus knows a lot of applications. There has been a long line 
of research on this problem. In this thesis we will consider As-set packing, which is the 
following problem. 

As-Set Packing (As-SP) 

Given: a universe U oi N elements and a collection C C 2 W of n subsets over 
U , each of cardinality As 

Find: a maximum collection of mutually disjoint subsets in C. 

Any collection of mutually disjoint subsets in C is called a packing and the goal is to 
find the largest packing. In As-set packing every subset in C contains at most As elements. 
Without loss of generality assume every subset contains exactly As elements: add some 
unique dummy elements to the subsets of less than As elements. 

As-set packing is a special case of the optimization version of the set packing problem, 
where there is no restriction on the cardinality of every subset in C. 

(Maximum) Set Packing (SP) 

Given: a universe IA of N elements and a collection C C 2^ of n subsets over IA 
Find: a maximum collection of mutually disjoint subsets in C. 
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CHAPTER 1. INTRODUCTION 


1.2 Terminology 

We proceed with some notational conventions and definitions to make sure no confusion 
may arise. By hi we mean the universe of elements over which the k- set system C C 2^ 
has been defined. We write \U\ = N and \C\ = n. By I = ( U,C ) a given instance for fc-set 
packing is denoted in which the objective is to find the largest packing. 

Definition 1. (Conflict graph) Let an instance I = (U,C) for the k-set packing problem 
be given. The conflict graph of I is the graph G where every subset in C is represented by a 
vertex. Two vertices are adjacent if and only if the subsets in C these vertices correspond 
to intersect each other. 

Now let A be some solution to this instance, i.e. a collection of subsets in C that are 
mutually disjoint. 

Definition 2. ( Intersection graph) Let an instance I for the k-set packing problem be 
given and let A and B be two packings. The intersection graph of A and B is the induced 
subgraph of their vertices in the conflict graph of I. 

The bipartite intersection graph of A and B thus contains a vertex for every set in A 
and B and two vertices are adjacent if and only if the subsets in the set packing instance are 
in conflict. Throughout this thesis calligraphic letters A will be used to denote collections 
of subsets and normal letters A to denote the set of vertices corresponding to A in the 
conflict or intersection graph. Denote Ng{B,A) = N(B) n A, the neighbours of B in A 
in their intersection graph G. Sometimes for brevity we just write N(G,A) if the graph 
G is clear from the context. We can now succinctly define an improving set. 

Definition 3. (Improving set) Let A and B be two packings. B is called an improving 
set for A when \ A U B \ Ng{B, A)\ > \A\, i.e. when adding the sets of B to the current 
solution A and removing the sets of A that they intersect, leads to a solution of larger 
cardinality. 

The removal of the neighbourhood of B in A ensures that the new solution is also 
mutually disjoint. Now define the following. 

Definition 4. (t-locally optimal solution) Let A be a packing. A is said to be t-locally 
optimal when for every collection of mutually disjoint subsets B with \B\ < t we have 
|.AU B\ Nq(B,A) | < |A|. In other words, a solution is a t-locally optimal solution when 
there does not exist an improving set of size at most t. 




1.3. CONTRIBUTION 
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Finally, an algorithm is said to approximate a maximization problem within a factor 
p > 1 if for every instance of the problem < p, where v(A) is the objective value 

of the output of the algorithm and v(OPT ) is the best achievable objective value for that 
instance. We call p its approximation guarantee. 


1.3 Contribution 

1.3.1 Improved integrality gap 

In this thesis we improve the integrality gap of a linear program of fc-set packing called 
the intersecting family LP (see Section [4. 21) . 

Theorem 5. Let e > 0. The integrality gap of the intersecting family LP is at most 
| + 1 + e. 


Following the results from Chib . this immediately implies the following two theorems. 

Theorem 6. Let e > 0. There is a polynomially sized LP for k-set packing with integrality 
gap at most | + 1 + e. 

Theorem 7. Let e > 0. There is a polynomially sized SDP for k-set packing with inte¬ 
grality gap at most I I fe. 

The previous bound on this integrality gap was hid [ CLIO ]. This was a continuation of 
the work on the standard linear programming relaxation for &;-set packing. We treat these 
results on the linear programs in Chapter [H The result on the semidehnite programming 
relaxation is treated in Chapter O along with some background. 

1.3.2 Simplified proof 

We simplify the proof of the main lemma of the currently best weighted approximation 
algorithm from Berman |Ber00t Lemma 2], The current proof is very clever but also very 
algebraic. We make the observation that the squared weight function somehow captures 
both the maximum weight and the sum of the weights of the neighbourhood of a vertex in 
the conflict graph of the instance. This allows us to avoid the algebraic proof and simplify 
it. 


We treat this result in Chapter [6l 
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CHAPTER 1. INTRODUCTION 


1.4 Outline of the thesis 

Chapter 2 Set packing is one of the fundamental optimization problems and therefore 
there are numerous applications within mathematics and real life. It is highly related 
to some other well-known problems such as the hypergraph matching problem and the 
maximum independent set problem. In Chapter [2] these applications and related problems 
are considered. 

Chapter 3 In Chapter [3] the current results on the k -set packing problem are discussed. 
First the unweighted approximation algorithms and the weighted approximation algo¬ 
rithms are considered in Sections l3. II and 13.21 For the unweighted problem the best approx¬ 
imation algorithm currently achieves an approximation guarantee of ^4+e: |Cygl3[fFY13| 
and for the weighted problem the best result is a ^^-approximation |BerOO) . 

Then Section 13.31 continues with the parameterized algorithms for k -set packing, as 
this problem is fixed parameter tractable. There has been a long line of research in this 
area and Appendix [~Y1 contains an overview of these algorithms. 

Chapter 0 ends with some results on the inherent hardness of the problem in Section 
13.41 Subsection 13.4.11 starts with some hardness results that apply to the general set 
packing problem. Subsections 13.4.21 13.4.31 and 13.4.41 contain theorems on fc-set packing 
specifically. It is NP-hard to approximate within a factor of 0 [HSS06) and three 

other results on the limits of local search techniques for the problem are mentioned |Cygl3, 
lFYl3llSWl3j . 

Chapter 4 Chapter 4 treats the results on the standard linear programming relaxation 
and the intersecting family LP and contains the proofs of Theorems [5] and [6j 

Chapter 5 The proof of Theorem [7] is given in Chapter [5] along with some background 
about semidefinite programming and the Lovasz Theta function. 

Chapter 6 The topic of Chapter [6] is weighted fc-set packing. Currently the best result 
is a -—^—-—approximation from Berman [BerOO] , He provides two algorithms. One is called 
SquareImp which uses a local search technique using the squared weight function. The 
second is called WishfulThinking which searches locally for structures which he calls 
nice claws (definitions are in Chapter [6]). He links these two algorithms in a shrewd way, 
allowing him to proof both the approximation guarantee and the polynomial running time. 
At the end we give a simplified proof of the main lemma. 


























1.4. OUTLINE OF THE THESIS 
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Chapter 7 Finally there is an extended discussion in Chapter [7] about possible improve¬ 
ments. The results on the LP and the SDP relaxation are discussed in Section o and 
whether these can be extended to the weighted case. We see why changing Berman’s 
[BerOO] weighted ^-^—approximation algorithm a bit does not yield an improvement in 
Section El In Section 17.31 the strong relation between set packing and the independent 
set problem (on bounded degree graphs) is considered and it is argued why the results for 
independent set are better. We discuss why the weighted problem asks for such different 
algorithms compared to the unweighted version of the problem in Section 17.41 Several 
suggestions for future research are given. 






7 


Chapter 2 

Applications and related problems 


2.1 Applications 

Set packing has a lot of applications in capital budgeting, crew scheduling, cutting stock, 
facilities location, graphs and networks, manufacturing, personnel scheduling, districting, 
information systems, vehicle routing and timetable scheduling, see |Vem98j for a survey. 
In this section some real life applications are mentioned and in Section 12.21 the relation 
of set packing to other combinatorial problems is discussed. Section 12.31 then gives some 
background on local search as a background for Chapter [3j 

Latin squares A nice application of set packing is the extension of partial Latin squares 
[GRS044 1HJKS07] . A partial Latin square is an n x n array in which each cell is either 
empty or coloured with exactly one of the colours {1,..., n}. A Latin square is a partial 
Latin square without empty cells where every colour occurs exactly once in every row 
and every column. The problem is given a partial Latin square to find a completion that 
colours as many empty cells as possible such that no rows or columns contain any colour 
more than once. 

This can be modeled as a set packing problem as follows. Let the universe U consist 
of 3n 2 elements of the form {ej,7j}, {ei,Cfc} and {rj,Ck} for every combination of some 
colour i, some row j and/or some column k. Call an element {ei,rj} or {ei,Ck} empty if 
respectively row j or column k does not contain colour i, and call {rj,Ck} empty if the 
cell in row j column k is empty. Now let the collection of subsets C consist of all triplets 
{{ei,rj},{ei,Ck},{rj,Ck}} where all three elements are empty. This creates a set packing 
instance where every triplet contained in the solution indicates to colour the cell at row j 
column k with colour i. 
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CHAPTER 2. APPLICATIONS AND RELATED PROBLEMS 


This idea can be extended to the popular Sudoku puzzles. A Sudoku is a 9 x 9 Latin 
square with the additional constraints that in the nine 3x3 “boxes” every colour is allowed 
only once. Using quadruples rather than triplets, this problem can be translated to set 
packing in a similar fashion. 


Other applications Three real life applications one could think of is the assignment of 
crew members to airplanes [Ski08| . the generation of a coalition structure in multiagent 
systems SLA + 99] and determining the winners in combinatorial auctions to maximise the 
profit [GLOO , San021 IVic61] . 


2.2 Related problems 

Set packing is highly related to the hypergraph matching problem and the independent 
set problem. This section surveys these relations and some special cases of set packing. 

2.2.1 Hypergraph matching 

Set packing and hypergraph matching are really the same problem with different names. 
This subsection contains some background on hypergraphs and hypergraph matchings and 
relates these notions to set packing and fc-set packing. 

Hypergraphs A hypergraph H is a pair H = (V, E) where V is the set of vertices and 
E is the set of hyperedges. A hyperedge e € E is a nonempty subset of the vertices. 
In a weighted hypergraph, every hyperedge e € E is associated with a weight w(e). A 
hyperedge e E E is said to contain, cover or be incident to a vertex v € V when u£e. 

For a vertex v, S(v) denotes the set of hyperedges incident to it. The degree of a 
vertex v is |<5(u)|. The cardinality of a hyperedge is the number of vertices it contains. 
When every vertex has the same degree k, the hypergraph is called fc-regular. When every 
hyperedge has the same cardinality k, the hypergraph is /c-uniform. A graph is thus a 
2-uniform hypergraph. In a graph, every edge is a subset of two vertices. 

Matchings A hypergraph matching is a subset of the hyperedges MCE' such that every 
vertex is covered at most once, i.e. the hyperedges are mutually disjoint. This generalises 
matchings in graphs. From now on, with matching we mean a hypergraph matching. The 
cardinality of a matching is the number of hyperedges it contains. A matching is called 
maximum if it has the largest cardinality of all possible matchings. In the hypergraph 
matching problem, a hypergraph is given and one needs to find the maximum matching. 












2.2. RELATED PROBLEMS 
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Hypergraph matching problem 

Given: a hypergraph H = (V. E) 

Find: a maximum collection of mutually disjoint hyperedges. 

A- partite hypergraphs The notion of bipartiteness in graphs can be generalised in a 
hypergraph to the concept of Ac-partiteness. A hypergraph H = (V, E ) is called Ac-partite if 
the set of vertices V can be partitioned into k classes V\..... Vj, such that every hyperedge 
e € E touches exactly one vertex from every vertex class. A Ac-partite hypergraph is 
Ac-uniform by definition. The hypergraph matching problem on a A:-partite graph is also 
called the Ac-dimensional matching problem. This generalises the bipartite matching on 
graphs where k = 2 and it is a restricted version of hypergraph matching on a Ac-uniform 
hypergraph. In particular the 3-dinrensional matching problem is well studied in literature. 

Relation to set packing Let a set packing instance S = (U,C ) be given. Define a 
hypergraph E[ = ( U,C ), calling the elements in U vertices and the subsets in C hyperedges. 
Then the set packing problem on S is exactly the same as the hypergraph matching 
problem on H. An instance of the Ac-set packing problem translates to an instance of the 
hypergraph matching problem on a Ac-uniform hypergraph, as every subset (hyperedge) 
contains exactly Ac elements. 

Results for set packing thus immediately apply to hypergraph matching and vice versa. 

2.2.2 Independent set 

Relation to set packing Set packing is also closely related to the (maximum) inde¬ 
pendent set problem. Given a graph G = (V, E), an independent set is a subset of vertices 
that are mutually non-adjacent, i.e. a set of vertices whose induced subgraph does not 
contain any edge. This subsection treats this problem and its relation to set packing. 

Independent set problem 

Given: a graph G = ( V , E) 

Find: a maximum collection of mutually non-adjacent vertices. 

Let a set packing instance I = (U,C) be given. Create the conflict graph G = ( C,E ) for 
the instance I. Then the edge set E captures all the intersections between the subsets. 
Any packing of C now corresponds to an independent set in G: as a packing is mutually 
disjoint by definition, the corresponding vertices are non-adjacent. On the other hand, 
any independent set in G corresponds to a packing of C. Finding a maximum packing is 
thus equivalent to finding a maximum independent set in the conflict graph. 
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CHAPTER 2. APPLICATIONS AND RELATED PROBLEMS 


Relation to A- set packing Now consider a A-set packing instance ( U,C) where every 
subset in C contains exactly k elements from U. Consider a fixed subset S and look at the 
set N(S ) of all subsets intersecting S. There can be at most k subsets in N(S) that are 
mutually disjoint, when each subset intersects S in a distinct element. Thus within IV(S'), 
the maximum packing has at most cardinality k. 

Now consider the corresponding conflict graph G. S' is a vertex in G and N(S) is the 
set of all neighbours of S. By the previous reasoning, N ( S ) contains an independent set of 
size at most k. Thus in the neighbourhood of any vertex of G there are at most k mutually 
non-adjacent vertices. In other words, the conflict graph is A + 1-claw free: it does not 
contain IL^fc+i as an induced subgraph, where K n , m is the complete bipartite graph on n 
and m vertices. A vertex can have any number of neighbours, but the maximum number 
of mutually non-adjacent neighbours is k. 

Independent set on bounded degree graphs On the other hand, the maximum 
independent set problem in graphs G where all degrees are bounded by A: is a special case 
of A-set packing. Map every vertex in G to a subset in C and map every edge to a distinct 
element in U. Let a subset in C (a vertex in G) contain all elements in U (edges in G) 
that are incident to it. Now every subset in C contains at most k elements because every 
vertex in G had degree at most k. Again, by adding dummy elements to the subsets with 
less than k elements, every subset in C has exactly k elements. As such, the A-set packing 
problem generalises the maximum independent set problem on bounded degree graphs. 

Section [7731 discusses the relation between the results on set packing and independent 
set more thoroughly. 

2.2.3 Special cases of set packing 

Clique and triangle packing There are some special cases of set packing we would like 
to mention. Instead of studying the set packing problem with a bound on the cardinality 
of every subset, it is also possible to study the problem with certain given structures on 
the given sets. One example is the clique packing problem, where one is given some graph 
G. A packing of G is a collection of pairwise vertex-disjoint subgraphs of G, each of which 
is isomorphic to a clique. A packing is said to cover an edge of G if one of the subgraphs 
contains that edge. In the clique packing problem, one tries to maximise the number 
of covered edges [CMWY09] , The special case where every clique is of size 3 is called 
the triangle packing problem [CTW09i 1HR06 ], The set packing problem generalises these 
problems. 








2.3. LOCAL SEARCH 
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Tree-like weighted set packing The tree-like weighted set packing problem is another 
subproblem with an additional structure on the sets that allows for better results that even 
extend to the weighted version. In this problem, the subsets in C can be organised into a 
forest F satisfying the following properties. Every vertex in F corresponds to one subset 
in C. If a vertex Y is a child of a vertex X, then Y is a subset of A. If Y and Z are distinct 
children from X, then Y and Z are disjoint. And the roots of all the trees are pairwise 
disjoint. When the sets are structured like this, the problem can be solved exactly in cubic 
time using a dynamic programming algorithm [GTlOj . 

k- diincn s iona 1 matching Perhaps the best known special case of /,:set packing is the 
/[-dimensional matching problem. As was mentioned in Subsection 12.2.11 this is the hy¬ 
pergraph matching problem on a ^-partite hypergraph, which is a stronger structure than 
a fc-uniform hypergraph. In particular 3-dimensional matching is well-known. All results 
for k -set packing immediately apply to /[-dimensional matching. 

2.3 Local search 

Background A lot of the approximation algorithms for k -set packing are related to the 
notion of local search techniques. In this section we survey this related topic. Local search 
is a natural heuristic to tackle difficult problems. The most basic solution approaches 
for a discrete optimization problem are generating just one solution (e.g. using a priority- 
based heuristic), enumerating implicitly all possible solutions (e.g. with branch and bound 
techniques) and generating several solutions and choosing the best. Local search is a 
heuristic that finds from an initial solution a sequence of solutions that use the previous 
solution for the next solution, with the goal to increase the objective value from the initial 
solution to a better value. A neighbourhood structure is defined on the solutions that 
somehow resembles similar solutions. Until some stopping criterion is reached, in every 
iteration a solution from the neighbourhood of the current solution is chosen. 

A local search heuristic usually consists of four main elements: a method to calculate 
an initial solution, a definition of the neighbourhood of a solution, a criterion to choose 
a solution from any neighbourhood, and a stopping criterion. The method to calculate 
an initial solution is needed to start a local search, but is usually not seen as a part of 
the local search technique itself. For example, Chapter 0 contains a lot of results stating 
that a locally optimal solution with respect to some neighbourhood structure achieves a 
certain approximation guarantee, without going into the details of how to find an initial 
solution or sometimes even how to iterate to the next solution. 
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CHAPTER 2. APPLICATIONS AND RELATED PROBLEMS 


Results Local search techniques are widely used in practice, the most common example 
being for the traveling salesman problem. However, there are not much positive results 
on local search in theory. In fact, a complexity class called VCS (polynomial time local 
search) has been defined [JPY88] . Problems that are R/LS-complete are as hard as the 
hardest local optimization problems. A lot of well-known heuristics are known to be VCS- 
complete. There are even some problems for which it has been proved that there is no 
sequence, of less than exponential length, of improvements ending in a locally optimal 
solution |Hal95j . To make matters worse, there are popular heuristics with good average- 
case behaviour that perform quite bad at relatively easy problems. For instance, the 
algorithm at the heart of the well-known simulated annealing performs quite poor at 
finding a maximum matching (which is even in V). 

On the positive side, set packing does not fit this negative theoretic framework. Cur¬ 
rently all approximation algorithms for both the weighted and the unweighted fc-set pack¬ 
ing problem use local search at their core. However, there are also some results on the 
limits of the approximation guarantees using local search. The next chapter proceeds with 
approximation algorithms for k -set packing and the bounds on what is achievable using 
this technique. 
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Chapter 3 

Current results 


3.1 Unweighted approximations 


In this section we briefly mention the state of the art of unweighted approximations for the 
fc-set packing problem. There has been a long line of work in this area and Subsection l3.1.1l 
starts with the first | + e-approximation given already in 1989 by Hurkens and Schrijver 
[HS89] . We give a proof and some intuition for the case of a 2-locally optimal solution. 


Subsection l3.1.2l continues with a -approximation [Hal95j and a ^jp+e-approximation 


|CGM13] . These search for improving sets of size 0(log n), so the improved approximation 
guarantee is at the cost of the running time, which is n°( logri ) for both algorithms (also 
called quasi-polynomial running time). 

Only this year these algorithms were adapted to run in polynomial time [Cvgl3 1SW13] . 
These results are mentioned in Subsection 13.1.31 together with an adaptation of the pp- 
approximation to turn it into another pp-approximation jFY13| . These are currently the 
best results. 

Next, Section 13.21 treats the current state of the art of the weighted approximation 
algorithms, followed by the parameterized algorithms and the hardness results in Sections 
1331 and l3~4l The results on the linear programming and semidefinite programming relax¬ 
ations are postponed to the next chapters, as these will be treated in more detail and we 
will provide a new bound for their integrality gap. 


3.1.1 The first approximation algorithm 

The unweighted fc-set packing problem can be solved in polynomial time for k = 2 )Min80] , 
so from now on assume k > 3. 
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CHAPTER 3. CURRENT RESULTS 


2-locally optimal solution Perhaps the easiest local search technique to try is to 
iteratively search for an improving set of size 2. This is either one subset in C that does 
not intersect the current solution A or a pair of subsets that intersect at most one subset 
of A. Starting with the empty solution and iteratively adding improving sets of size 2 
yields an algorithm that is ^^--approximate |BNR9ol lHa!95| . 


First approximation algorithm A natural extension of this search technique is to 
search for improving sets of larger cardinality. If instead of improving sets of size 2 
improving sets of constant size s are searched for increasing values of s, it is possible to 
obtain a polynomial time approximation ratio of f+£. This was discovered by Hurkens and 
Schrijver [HS89] and this was actually the first approximation algorithm for k -set packing. 
Even though the problem was well-studied in the years that followed, this remained the 
best polynomial time approximation for over 20 years. 

To be precise, they proved the following. 

Theorem 8. ( IHS89 1 Theorem 1]) Let E \,..., E m he subsets of the set V of size n, such 
that: 


1. Each element of V is contained in at most k of the sets E\,..., E m ; 


2. Any collection of at most t sets among E \, 


. E„ 


sentative 


fl 


has a system of distinct repre- 


Then, we have the following: 


m ^ k(k — 1 ) r — k 
n ~ 2 {k — l) r — k 
m ^ k(k - l) r - 2 
~n ~ 2 (k - l) r - 2 


if t = 2r — 1; 


if t = 2 r. 


Proof for t = 2 They provide a very keen yet complicated proof. We will give the proof 
and some intuition for the case t = 2, which establishes that any 2-locally optimal solution 
is at most a factor of away from the optimal solution. Let A be any 2-locally optimal 
solution and B be any optimal solution. Denote by B\ and £>2 the sets in B that intersect 
A in one set respectively at least two sets. As A is 2-locally optimal there are no sets in 
B that do not intersect any set of A. Note that every set in A intersects with at most k 
sets in B. So 

|£>i| + 2|£? 2 | < k\A\. 

X A system of distinct representatives of sets {-E;} is a set {e;} such that d £ E; for all i and d A e :i 
for i j. 
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Since A is 2-locally optimal, \B\\ < \A\ and hence 

2\B\ = \Bi\ + 2\B 2 \ + \B\\ < k\A\ + \A\ = {k + l)\A\, 
and therefore jpj- < Np-. 

Intuition Intuitively the argument boils down to the following. Look at the intersection 
graph of the current 2-locally optimal solution A and some optimal packing B. The sets 
in B that conflict with just one set in A are not particularly interesting as they do not 
form an improving set. Let us delete these sets from the intersection graph together with 
the sets in A they intersect. Because A is a 2-locally optimal solution, the sets in B 
conflicting with two sets in A still have degree at least 1, while the degrees of the other 
sets can be anything. That is fine, because in general when every degree is at most two 
we are done: the number of edges between A and B is then at most 2m and this number 
is upper bounded by nk and hence 2m < nk. So the core of the argument really consists 
of small conflicts posing no problem and concentrating on the remainder of the graph. 
The iterative process of deleting these sets cannot go on forever, because there are only 
so many vertices in A and possibly much more in B. 


3.1.2 Quasi-polynomial time algorithms 

A ^-^-approximation Halldorsson [Hal95] was the first to find ^^-approximate so¬ 
lutions. His clever insight is that using a search space of improving sets of size O(logn) 
allows to improve the approximation guarantee from about | to about 4- However, in 
a straightforward implementation of an iterative algorithm with such a search space, the 
running time would be quasi-polynomial, i.e. n°( logn l The deduction of this running time 
is straightforward: one can find an improving set of size O(logn) in time n°^ ogn \ and the 
number of iterations is trivially upper bounded by n. 


A pp + e-approximation This year, Cygan, Grandoni and Mastrolilli |CGM13] im¬ 
proved upon this result and showed that the approximation guarantee of an O(logn)- 
locally optimal solution can in fact be bounded by pp + e. The running time is still 
quasi-polynomial. Their analysis is more involved and they use a lemma from |BF94| as 
a black box. In Chapter [4] the same lemma is used to prove an improved bound on the 
integrality gap of an LP formulation for the &:-set packing problem. 
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3.1.3 Polynomial time algorithms 

A ^^-approximation Sviridenko and Ward |SW13| recently established an elegant 
polynomial time approximation algorithm that beats the approximation guarantee of § 
and this is the first polynomial time improvement over the | + e result from '11S89I . Their 
approximation guarantee is 

Their key insight is that using improving sets with a special structure in the analysis of 
Halldorsson |Hal95| allows to reduce the running time. They look at a graph with a vertex 
for every set in the current solution A. Two vertices S, T are connected if there is a set in 
C \A that intersects A only in S and T. Possibly S = T, which gives rise to a loop; so every 
set that intersects with at most two sets in A corresponds to one edge. Then they define 
three canonical improvements, which are three types of connected graphs containing two 
distinct cycles, thus having more edges than vertices. They iteratively search for canonical 
improvements, and when one is found they remove the vertices from A and add the edges 
from C \ A to the current solution. 

Another key insight is then to combine this with the famous color-coding technique 
|AYZ95| and an application of dynamic programming, which allows them to efficiently 
search for these canonical improvements. The result is a new polynomial time approxima¬ 
tion after more than 20 years. 

A _|_ g-approximation Only two months later the polynomial time approximation 
guarantee was cleverly improved to + e by Cygan |Cygl3| . He adapted the quasi¬ 
polynomial -approximation from |CGM13] , also with the sharp insight that structured 
sets reduce the search space and the running time without losing the good approximation 
guarantee. His algorithm searches for elegant structures that he calls improving sets of 
bounded pathwidth. Again using the color-coding technique, the algorithm can search the 
space of O(logra) improving sets of bounded pathwidth in polynomial time. 

Another + e-approximation Independently, another + e polynomial time 
approximation algorithm was found four months later by Ftirer and Yu | FY 13 1. This is 
an ingenious improvement of the polynomial time ^^-approximation of [SW13j . Let A 
denote the current solution again. Define a bipartite auxiliary graph G _4 with a vertex for 
every set in A in one colour class and a vertex for every set in C \ A in the other colour 
class. Connect two vertices from both colour classes if and only if the sets corresponding 
to them intersect. Indeed, this is the conflict graph of the instance with only a subset of 
the edges. 



















3.2. WEIGHTED APPROXIMATIONS 


17 


The algorithm starts by looking for improving sets of constant size as in [HS89] . Then 
they guess O(logn) sets X in C \ A that might form an improving set. Now consider any 
collection of sets Z 3 within X of degree at least 3 in the graph G 4 . For such a collection, the 
algorithm looks for a sequence of replacements that swaps t sets in A with t sets outside 
A. The idea is to increase the number of sets in C \ A that intersect A in at most two sets. 
If the degree of the vertices in Z 3 now drops to 2 or less, they check whether these vertices 
together with sets of degree at most 2 in X, form a canonical improvement as in (3W13j . 
If so, this canonical improvement is added to obtain a solution of larger cardinality. 

Again the color-coding technique is exploited to find all structures in polynomial time. 
In comparison, the local improvements in this paper are less general than in the other 
+ e polynomial time approximation by Cygan. This results in the fact that this new 
analysis is simpler than the one in |Cygl3| . 

3.2 Weighted approximations 

This section considers approximation algorithms for weighted k-set packing. In the weighted 
fc-set packing problem, every set S in C is associated with a weight w(S) and the objective 
is to maximise the total weight of the packing rather than its cardinality. 

Also the weighted fc-set packing problem can be solved in polynomial time for k = 2 
|NT99| . so from now on assume k > 3. 

3.2.1 Three close to /^-approximations 

A ^-approximation The first approximation algorithm for weighted k- set packing is 
due to Hochbaum |Hoc83| who achieved an approximation guarantee of k. His algorithm 
first preprocesses the input and then more or less greedily adds sets to the current solution. 

A k — 1 + ^-approximation A weighted k — 1 + ^-approximation algorithm was found 
in [BNR95 ]. They show that the simple greedy algorithm approximates the weighted 
problem within a factor of k , where the greedy approach successively selects the subset 
with the highest weight from all subsets that do not intersect any selected subset so far. 
Using a similar local search analysis as in the unweighted case they are able to improve 
this bound to k — 1 + p 

A k — 1 + e-approximation This result was slightly improved by Arkin and Hassin 
IAH97 I. Their setting is more general in the following sense. They consider a bipartite 
graph G = (U U V, E) where every vertex is associated with a weight. They use the 
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shorthand notation w(X) = Y2 xgx w ( x ) f° r a subset of the vertices X, and write E v 
for the neighbourhood N(v) of some vertex v. Then analogously to |HS89| ] they assume 
that \E V \ < k for all v E V and that any subset R C U of at most t nodes satisfies 
w{R) <w(\J ueR E r ). 

Their main theorem is that < k — 1 + ■}. The k — 1 + ^-approximation from 

[BNR95j is the case where k = t, so this result is slightly more general. Effectively, they 
reached an approximation guarantee of k — 1 + e. 

3.2.2 A ^±2 and a ^-approximation 

The algorithm Chandra and Halldorsson [CH99] were the first to beat this bound and 
obtain an approximation guarantee of 2 fc j~ 2 . They combine the greedy algorithm to find 
an initial solution and local search techniques to improve upon this solution. Essentially 
they show that either the greedy solution is already good, or the local search improves it 
quite well. They reduce the problem to the independent set problem in a k + 1-claw free 
graph G, and their local search finds an improving claw. This is a claw C whose center 
vertex v is in the current solution A in G, and hence the talons Tc of the claw (its other 
vertices forming an independent set) are not in A. They show that using local search to 
find any improving claw leads to an approximation guarantee of 4fc + 2 , while a local search 
for the best local improvement is 2 k ^ 2 -approximate. With the best improvement they 
mean the claw C with the maximum ratio between the sum of the weights of T c and the 
sum of the weights of the neighbours of Tc in A. 

Time complexity To show that the algorithm runs in polynomial time, they need to 
modify it a bit. First they find a solution A using the greedy approach. Then they rescale 
the weight function such that w(A) = kn. Then they keep searching for the best local 
improvement using the weight function |_irj • Now each iteration increases |_irj (A) by at 
least one. Since A is at most a factor of k away from the optimal solution, the number of 
iterations is bounded by k 2 n. Since in every turn they only inspect a polynomial number 
of candidates, the algorithm now runs in polynomial time. 

3.2.3 A close to ^-approximation 

Berman and Krysta IBK03’ improved the approximation guarantee slightly. For every 
k they find the optimal value for a such that any 2 -locally optimal solution with re¬ 
spect to the “misdirected” weight function w a achieves the best approximation guaran¬ 
tee. Surprisingly there are only three distinct values for a that cover all values of k. 
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There is an optimal value of a for k = 3 which yields an approximation guarantee of 
^ ~ 0.66667fc, there is an optimal value for k = 4 which approximates the problem 
within —k ~ 0.65139& and there is a value that is optimal for k > 5, resulting in a 
2” log 3 2 /c ss 0.64576fc-approximation. 

3.2.4 A ^ii-approximation algorithm 

Currently the best approximation algorithm for weighted k -set packing is from Berman 
[BerOOj . also using another objective function w 2 rather than w. Its approximation guar¬ 
antee is Berman dehnes a function charge(rt,u) and searches for minimal claws that 
satisfy some condition on this charge function, called nice claws. He shows that if this 
algorithm terminates it achieves the desired approximation guarantee. To show that the 
algorithm terminates, he shows that every nice claw improves the square of the weight 
function. Similar to the running time analysis of [ CH99j . he shows that the number of iter¬ 
ations where w 2 improves is polynomially bounded. And when there is no more claw that 
improves w 2 , there is no nice claw anymore, and hence the algorithm terminates. Therefore 
it runs in polynomial time and achieves the approximation guarantee ^£1 • Chronologically, 
this result came before the result from the previous subsection [BK03] . 

Chapter [6] dives into the details of this paper and we show a simplified and more 
intuitive proof of the fact that a nice claw improves w 2 . 


3.3 Parameterized complexity 

Background Due to the limited results in the area of approximation algorithms for a 
long time, people started to study the parameterized problem where the cardinality of 
the solution is assumed to be rn. In the fc-set m-packing problem the goal is to find m 
disjoint sets, each of size k. The running time of exact algorithms for this problem can be 
written in the form f(m)poly(n ), moving the exponential dependency to the parameter 
m instead of n. There have been huge improvements on the running time of these exact 
algorithms. This is in spite of the fact that the fc-set m-packing problem is IE[l]-complete 
with respect to the parameter m IDF99| (see Subsection 13.4.31 for details about W[l]- 
completeness). These results contain some algorithms especially designed for the case 
k = 3. Table DPI in Appendix [XI gives an overview of the results on the 3-set m-packing, 
and we refer to Table lM2l in Appendix [A] for the more general k -set m-packing problem. 
Some results also extend to the weighted case, this is mentioned in the last column. We 
follow the convention from parameterized algorithms to let 0 *(f(m )) denote 
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Improvements Some remarks in the last column of these tables require some expla¬ 
nation. Koutis ’ jKou05j original result was an 0 *(time algorithm. In |CLSZ07l 
1LLCS06] it was pointed out that the constants are huge in this approximation, showing 
the bound is at least 0*(32000 3 " 1 ) when k = 3. Like many results in this area, Koutis de- 
randomised his algorithm using the color-coding technique from Alon, Yuster and Zwick 
AYZ95 . In (CLSZ07, ILLCS06 ] a new perfect hashing technique was introduced, with 
which Koutis’ result could be improved to 0*(25.6 mfc ). A similar result is true for the 
deterministic algorithm by Fellows et al [FKN + 08] , Its original bound was exp(0(mk)), 
which was actually 0*((12.7D) 3m ) for some D > 10.4 (when k = 3). This is improved to 
0*(13.78 mfc ) using the new perfect hash function. 

The first deterministic result of the form 0*(g(m , k)) for k -set ?n-packing is due to Jia, 
Zhang and Chen |JZC04| . Currently the best complexity result (randomised) result for 
k -set m-packing is 0*(f(m,k)) by Bjorklund et al {BHKKlOj . This function is not very 
readable or insightful, but behaves well for small k. For example, when k = 3, we have 
0*(/(3, m)) « 0*(1.493 3m ), which is the result mentioned in Table [ATI The function is 
strictly smaller than 0 *{2 mk ). 


Hardness Some of the fastest parameterized algorithms rely on group algebra theory. 
A variable is introduced for every element and a subset is the product of its variables. A 
packing then corresponds to a multilinear polynomial of degree mk. Koutis and Williams 
|KW09| showed that detecting such a polynomial cannot be done in their model in time 
faster than 0*(2 mk ). These results still hold if the color-coding method is used ( jAYZ95j ) 
or the randomised divide-and-conquer approach ( [CKL + 09[ Cl.S7.07 ). 

This is the only result on the limit of the time complexity of these parameterized 
algorithms. There are more hardness results known for the approximation algorithms of 
non-parameterized k -set packing. The next section captures an outline of these hardness 
results. 


3.4 Hardness results 

Subsection 13.4.11 considers some results on the general set packing problem. Subsections 
13.4.21 13.4.31 and 13.4.41 consider hardness results specifically for k -set packing. These are 
respectively a result on the hardness of approximation, on the non-existence of certain 
algorithms and on the limits of local search techniques for this problem. 
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Covering-packing dualities 

Minimum set cover Maximum set packing 
Minimum vertex cover Maximum matching 
Minimum edge cover Maximum independent set 

Table 3.1: Every horizontal pair of problems are each other’s LP-duals. 

3.4.1 Hardness of set packing 

Set packing is one of the standard packing problems, which are closely related to covering 
problems. Table l3Tl lists some of these problems. Set packing is the most general of these 
packing problems, together with its LP-dual minimum set cover. We refer to jADP80 l for 
a survey of these problems. 

The following results on the hardness of the problems also apply to the set packing 
problem. Arora et al showed that, unless V = A fV, the vertex cover does not admit 
a polynomial time approximation scheme, even on bounded degree graphs |ALM + 98 . 
Moreover, also assuming that V ^ J\TV, it has been shown that there is no constant-ratio 
polynomial time approximation scheme for independent set ( GJ79, P&821 . Hastad proved 
that set packing cannot be approximated within n 1_£ , where n is the number of sets, 
unless A fV = ZVV [Uas96] . On the positive side, set packing can be approximated within 
a factor of \Z~N [HKTOO ] (recall that N is the number of elements in IX). Hastad’s result 
also implies this is the best possible assuming MV / ZVV. 

3.4.2 Hardness of approximation 

As noted in Subsection 13.1.31 currently the best approximation guarantee for unweighted 
k- set packing is pp + e. Hazan, Safra and Schwarz [HSS06J showed the following hard¬ 
ness of approximation result. There is still a gap to bridge between the currently best 
approximation guarantee and this bound. 

Theorem 9. f\HSS06 1/ ) It is NP-hard to approximate k-set packing in polynomial time 
within a factor of O • 

Here is a rough outline of their argument. Define gap -P-[a,b] to be the following 
decision problem: decide on an instance of the decision problem P whether there exists 
a fractional solution of size at least b or whether every solution of the given instance is 
of fractional size smaller than a. Then if gap -P-[a,b\ is NP-hard, so is approximating P 
within a factor smaller than -. 

a 
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Define MAX-3-LIN-q 1 as the optimization problem where a set of linear equations over 
GF(q) is given, each depending on 3 variables, and one needs to find an assignment 
maximising the number of satisfied equations. Hastad [HasOlj proved that gap-MAX-3- 
LIN-g-[i + e, 1 — e] is NP-hard, which was a central result in the theory of hardness of 


approximation. 

In 1 1 ISSbiil they provide a polynomial time reduction from MAX-.3-LIN -<7 to ^-uniform 
hypergraph matching (i.e. to fc-set packing). They use what they call a (q, <5)-Hyper- 
Graph-Edge-Disperser as a gadget (see [RTSOO] for background on dispersers and extrac¬ 
tors). They construct such a gadget for every variable occurring in the equations. The set 
of all these gadgets is then the set of vertices of a hypergraph H. By a clever construction 
of the hyperedges they relate the satisfied equations to a packing in this hypergraph. This 
enables them to show that gap-fc-SP-[4r-|-£, ^r — 1] is NP-hard. As they have k = 0(glog q ), 
they find a bound for the inapproximability factor for k- set packing of D(j-^x). 


3.4.3 Non-existence of certain algorithms 

There are also some other hardness results known, more specifically on the existence of 
certain algorithms or on the limits of what is achievable using local search techniques. 
Cygan |Cygl3| proved the following. 


Theorem 10. (JCygl3[ Theorem 1.1]) It is W[l]-hard to search the whole space of im¬ 
proving sets of size r efficiently. 

More formally, unless FPT = W[l], there is no f{r)poly{n) time algorithm that given 
a family C C 2 U of sets of size 3 and a disjoint subfamily A C C either finds a bigger disjoint 
family B C C or verifies that there is no disjoint family B C C such that |M\R| + |R\M| < r. 


Fixed parameter tractability This theorem requires some explanation (see e.g. [CHKX0'5l 
!DF99j ). FPT is the set of fixed parameter tractable problems, which are problems with 
input size n that can be solved in time f(k)n°^ for some function f. FPT thus classi¬ 
fies problems according to multiple parameters rather than a single parameter, where it is 
crucial that functions /(n, k ) like n k are not allowed. If the value of k is fixed, the problem 
is said to be parameterized, which is the setting of Subsection 13.31 with parameter m. 


VU-hierarchy of FPT The IT- hierarchy (Jt>o IU [t] has been introduced to formalise 
the level of intractability of problems. We have FPT = VU[0] and W[i] C W[j] whenever 
i < j- We do not go into the details and refer the interested reader to [CHKX051IDF99] , 
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Theorem 1101 assumes FPT A W[l], which is a widely believed assumption. In par¬ 
ticular, this assumption is equivalent to the famous Exponential Time Hypothesis (ETH) 
jIP99j . If FPT = W[l] then the ETH fails and vice versa |Cygl3[[CHKX05j . When a prob¬ 
lem is IU[l]-complete (i.e. not solvable in polynomial time unless FPT = W[t\, here for 
t = 1), this is strong evidence that the problem is probably not fixed parameter tractable. 
For example, the clique problem and the independent set problem are W [l]-complete and 
the dominating set problem and the set cover problem are IT [2]-complete. 

The assumption in Theorem [TO] is thus plausible, and if this is true then there is no 
polynomial time algorithm which searches the whole space of size r improving sets. 


3.4.4 Limits of local search 


Next to this result on the existence of such an algorithm, there is also a lower bound on 
the approximation guarantee that algorithms based on local search techniques can achieve. 
Sviridenko and Ward [SW13) proved the following. 

Theorem 11. (fMM Theorem, 6.1]) The locality gap of a t-locally optimal solution is 
at least even when t is allowed to grow on the order of n. 

More formally, let c = and suppose that t < cn for all sufficiently large n. Then 
there exist 2 pairwise disjoint collections of k-sets A and B with |^4| = 3n and \B\ = kn 
such that any collection of a <t sets in B intersects with at least a sets in A. 


This result shows that even local search algorithms that are allowed to examine some 
exponential number of possible improvements at each stage cannot achieve an approxima¬ 
tion guarantee better than |. This suggests that local search algorithms, which currently 
achieve an approximation guarantee of pp- + e, are not a good approach to beat the 
approximation guarantee much further. Ftirer and Yu |FY13j extended this result. 


Theorem 12. (IFYlSi Theorem 7]) There is an instance for k-set packing with locality 
gap ppL such that there is no local improvement of size up to 0 (ns). 

More formally, for any t < ° there exist two disjoint collections of k-sets A and 

B with |^4| = 3n and \B\ = (k + l)n such that any collection of t sets in A intersects with 
at least t sets in B. 


So local search has reached its limits for all practical purposes. To achieve an approx¬ 
imation guarantee beating the order of other approaches to the problem are necessary 
to consider. Another approach one could take is the LP and SDP relaxations for fc-set 
packing. The next chapters treats these two approaches and show improved bounds on 
their integrality gap. 
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Chapter 4 

LP formulation 


This chapter treats the current results on the LP relaxation of k -set packing. Section 14.11 
starts with the results for the standard LP relaxation. Then Section 14.21 shows how to 
strengthen this LP and we prove a new bound for the integrality gap of this strengthened 
LP. Finally Section [4.31 presents how to decrease the size of this LP to a polynomial size. 

4.1 The standard LP 

LP-formulation View the set packing problem as the hypergraph matching problem like 
in Subsection 12.2.11 Introduce a variable x e for every hyperedge e that indicates whether 
e is included in the matching or not. The objective is to maximise x e such that every 

e&E 

vertex is covered only once. For convenience we introduce the following notation. 

X ( F ) ■= Yh x e- 

e€F 

Hence the natural linear program looks like the following, denoted by (ILPD . As before, 
5(v) denotes the set of hyper edges incident to v. 

max x(E ) 

s.t. x(5(u))<l, VvGk (LP) 

0 < x e < 1, Ve € E 

Results Fiiredi, Kahn and Seymour |FKS93] have shown that the integrality gap of 
m equals k — 1 + ^ for fc-uniform hypergraphs. They also show that in the case of a 
fc-partite hypergraph the integrality gap equals k— 1 , but both proofs are non-algorithmic. 
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Figure 4.1: The Fano plane. A hyperedge is represented by a line connecting three vertices. 

Chan and Lau [ CLIO] gave an algorithmic proof of these facts and showed that the bounds 
are tight. Both results also extend to the weighted case. 

Tight example As a tight example for the non-fc-partite case, consider the projective 
plane of order k— 1. This is a hypergraph that is fc-uniform (every hyperedge has cardinality 
k), L-regular (every vertex has degree k), in which every pair of hyperedges intersects in 
one vertex, and in which every pair of vertices is contained in exactly one hyperedge. 
Equivalently it is the Steiner system S(2,k,k 2 — k + 1). A projective plane of order k 
exists if k is a prime power and the conjecture that this is also a sufficient condition is 
a long standing open question. The projective plane of order 2 (thus corresponding to 
3-set packing) is the well-known Fano plane. Figure 14.11 depicts a Fano plane, where a 
hyperedge is represented by a line connecting three vertices. 

To see that the projective plane of order k — 1 is a tight example, note that the integral 
solution to this L-set packing instance equals 1 as every hyperedge intersects every other 
hyperedge. But fractionally, it is possible to set x e = 4 for every hyperedge because the 
hypergraph is ^-regular. This is a feasible solution to m and since the hypergraph has 
k 2 — k + 1 hyperedges the integrality gap equals \{k 2 — k + 1) = k — 1 + p 

4.2 Strengthening the LP formulation 

4.2.1 The intersecting family LP 

The LP formulation can be strengthened by adding extra local constraints. Call a family 
of hyperedges intersecting if every two of them overlap in at least one vertex. In the 
conflict graph of an instance, an intersecting family F would be a clique F. Obviously, 
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from every intersecting family only one hyperedge can be picked. For every intersecting 
family thus an extra constraint can be added to obtain the following strengthened LP. Let 
JC denote the collection of all intersecting families. 

max x(E) 

s.t. x(<5(u))<l, Mv € V 

(Intersecting family LP) 

x (K) < 1, MK G /C 
0 < x e < 1, Me € E 

This is called the interesting family LP. Note that this LP in general has exponentially 
many constraints and hence it is not solvable in polynomial time. In Section 14.31 we will 
show that for each k the LP can be rewritten into an LP with a number of constraints that 
is polynomial in n. In [ CLIO] , they proved the following theorem about this intersecting 
family LP. 

Theorem 13. (\CLlCk Theorem 4-1]) The ratio between any LP solution to the intersect¬ 
ing family LP and any 2-locally optimal solution is at most Thus the integrality gap 
of the intersecting family LP is at most . 

We omit the proof, because we will show an improved bound in Theorem 1171 In order 
to proof this, the lemmas from HI- 9 1 treated in the next subsections are necessary. 


4.2.2 A lemma on multigraphs 

Lemma 14. (\BF0J t \ Lemma 3.1]) Assume that every vertex in a mu/ti^rap/0 G = (V,E) 
has degree at least 3. Then every vertex v € V belongs to a connected induced subgraph 
G[X] with strictly more edges than vertices, of at most 41og 2 n — 1 vertices. 


Proof. This is a slightly more detailed proof than in [BF94 j. Let G = (V,E) be a multi¬ 
graph where every vertex has degree at least 3 and let v E V be an arbitrary vertex. 
Consider a breadth-first search tree T of G rooted at v. The distance of a vertex in T is 
the length of the shortest path from the vertex to v. 

Let m > 0 and suppose every vertex in T at a distance less than rri has at least two 
children. Then T has at least 2 m vertices. Since T has n vertices, m < log 2 n. Since v 
has degree at least 3, in fact m < log 2 n. This implies that there must be a vertex u at a 
distance of at most log 2 n — 1 having at most one child. 


X A multigraph is a graph where multiple edges and loops are allowed. 
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Since u has at least degree 3 but at most one child, one of the edges incident to u in 
T must be a cross edge, a loop or a multiple edge. Let’s denote this edge by e = {u, w}, 
where possibly w = u. Then the tree paths from v to u and from v to w together with 
the edge e form an induced subgraph G[Y] of at least as many edges as vertices, and this 
induced subgraph has at most 21og 2 n vertices. 

If the number of edges in G\Y] is already strictly larger than the number of vertices 
in G[y], then the proof is finished. Otherwise the number of edges equals the number of 
vertices. In this case, shrink G[Y] to one single vertex y. In this modified graph G', every 
vertex still has degree at least 3. Therefore G’ contains another induced subgraph induced 
by some Z U {y} with at least as many edges as vertices, of size at most 2 log 2 n. But 
then G\Y] U G[Z] contains strictly more edges than vertices in G and its size is at most 
41og 2 n — 1, because y is not a vertex in G. So G[Y U Z\ is an induced subgraph of G that 
satisfies the required properties, as G[Y U Z\ has at least as many edges as G[Y]UG[Z]. □ 

4.2.3 Another lemma on multigraphs 

Using this lemma following lemma can be proved, which is the one really needed for 
Theorem QU 

Lemma 15. ( \BF94\ Lemma 3.2]) For any integer h > 1, any undirected multigraph 
G = (V,E) with \E\ > ^p|V| contains a set X of less than 4/ilog 2 n vertices, such that 
in G[X] there are more edges than vertices. 

Proof. Also this is a slightly more detailed proof than in [BF94j . Let h be any posi¬ 
tive integer. For convenience, for any undirected multigraph H = (V(H),E(H)) denote 
\V(H)\ = Vh, \E(H)\ = Eh- H is said to satisfy Property (*) if Ejj > Given a 

set of vertices U in a multigraph H , denote Vjj := Vh[u] > Eu := ^H[u]■ U is said to satisfy 
Property (*) if its induced subgraph H[U) satisfies Property (*). 

Let G = ( V,E) be an undirected multigraph that satisfies Property (*), so \E\ > 
^-\V\. We need to show that T contains a set X of less than 4/ilog 2 n vertices such that 
in G[X] there are more edges than vertices. 

Let JJ C U be the smallest set of vertices in G that satisfies Property (*). Because U 
is a minimal set, G[U] cannot have a vertex u of degree 1: otherwise U \ {u} would be a 
smaller set satisfying Property (*). 

Now consider a maximal chain of degree 2 vertices C in G[U] and denote the two 
vertices adjacent to C, its endpoints in G[U] \ C, by x and y. First we show that C 
has less than h vertices. Assume the contrary and suppose C has exactly h vertices. 
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We claim that U \ C is a smaller set of vertices that satisfies Property (*) which is a 
contradiction. Note that Ey\c = Ejj — h — 1 because G\C] contains h — 1 edges and is 
connected to the rest of U by 2 other edges. Also Vjj\c = Vjj — h. So we need to show 
that Ejj — h — 1 > ^4-^- (Vjj — h). But this is true: subtract h + 1 on both sides from the 
known fact that Eu > to get to this equation. So if C contains h vertices, U is not 

the smallest set of vertices satisfying Property (*), which is a contradiction. Note that the 
argument still holds if C has p > h vertices: then on the left-hand side p + 1 is subtracted 
while on the right-hand side ^-p = P + > P + 1 is subtracted, so the left-hand side is 

indeed still larger than the right-hand side. We conclude that C has strictly less than h 
vertices. 

Now modify the graph G[U]. Replace every maximal chain of degree 2 vertices C 
by a single edge connecting its endpoints x and y. Since any two such chains do not 
intersect (except possibly at their endpoints), every such chain is replaced independently 
from another. Denote the resulting graph, obtained from G[U] by contracting all such 
chains, by G'. Then G' does not have any vertices of degree 2 as they are all contracted. 

Therefore all vertices in G' have degree at least 3, and Lemma [IT] applies. Hence G' 
contains some induced subgraph G' [A" 7 ] of size m < 4 log 2 n — 1 with strictly more edges 
than vertices. Now select m + 1 edges in G'[X'\ and replace the edges that were chains in 
G[U] by their respective chains of degree 2 vertices. Denote the resulting graph, obtained 
from m + 1 edges from G'[X'] by expanding the chains, by G[X\ (note that G[X] is indeed 
an induced subgraph of G so this notation is valid). 

Note that in the expanding of the chains, at most m+ 1 chains are expanded and every 
chain has size less than h. So G[X] has in total at most m + (m + l)(h — 1) = (m + l)h — 1 
vertices. As m < 41og 2 n — 1 we see that Vx = (m + 1 )h — 1 < 4/ilog 2 n vertices and the 
proof is complete. □ 

The following lemma is an immediate consequence. 

Lemma 16. (Special case of Lemma \15\) Let e > 0. If in an undirected multigraph 
G = (V,E) there is no subset of vertices X of size at most 4(1 + i)log 2 n such that in 
G[X] there are more edges than vertices, then \E\ < (1 +e)|V|. 

4.2.4 A new bound on the integrality gap 

Using this last lemma, the following improved bound on the integrality gap of the inter¬ 
secting family LP can be established. 
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Theorem 17. Let e > 0. The ratio between any LP solution to the intersecting family LP 
and any 4(1 + -) log 2 n-locally optimal solution is at most | + 1 + e. Thus the integrality 
gap of the intersecting family LP is at most | + 1 + e. [NOTE: see page Hi] 


Proof. Let M be a 4(1 + i) log 2 n-locally optimal matching. Let x be a feasible solution to 
the intersecting family LP, and let T be the set of hyperedges with x e > 0. To prove the 
theorem it suffices to prove that x(E) < (| + 1 + e) |M|. Denote by F\, T 2 and .F 34 . the 
subsets of T in which every hyperedge in intersects exactly one, exactly two or at least 
three hyperedges in M, respectively. 

Note that M is in particular a 1-local and a 2-locally optimal matching. Since M is 
a 1 -locally optimal matching, each hyperedge e in E intersects at least one hyperedge in 
M: otherwise M U {e} would be a larger matching. Hence T = E\ U E 2 U We will 
now proceed to bound x(J-i) and x(J r 2 ) in terms of |M|. 

Consider a hyperedge e in M and define E\{e) := {/ £ E\ \ f n e / 0}. Then E\ (e) 
is an intersecting family. For suppose this is false, then there are two disjoint hyperedges 
/ 1 , / 2 in J -\. These are disjoint from all other hyperedges in M \ {e} because / 1 , / 2 € F\. 
Therefore M — e+/i + / 2 is a larger matching than M, but this is in contradiction with the 
fact that M is a 2-locally optimal matching. Hence J~\ (e) is an intersecting family. Thus 
by the intersecting family constraint ^(^(e)) < 1. Now summing over all hyperedges 
e € M yields 

®(^i) < |M|. (4.1) 

For the purpose of the analysis of the bound on x(J- 2 ), consider an auxiliary multigraph 
H with a vertex for every set in M. Two vertices mi, m 2 € M are adjacent in H if and 
only if there is a set in J -2 that intersects both sets corresponding to mi and m 2 . Note that 
H consists of exactly \M\ vertices and \J~ 2 \ edges, potentially some of them are parallel. 
By assumption, M cannot be improved by a set of size at most 4(1 + i)log 2 ?r. As an 
induced subgraph of H with more edges than vertices constitutes an improving set, H 
cannot contain an induced subgraph of size at most 4(1 + 7 ) log 2 n with more edges than 
vertices. Hence, by Lemma 1161 we infer that \E(H)\ < \V(P[)\(1 + e). Consequently, 
IJ^I < (1 +e)\M\. As x(.F 2 ) < || it follows that 

x{E 2 )<(1 + s)\M\. (4.2) 


Now that x{F\) and x(J r 2 ) are upper bounded in terms of \M\, the result follows easily. 
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Note that there are k\M\ vertices in M. Then the degree constraint yields 

k\M\ > x(J'i) + 2.t(J 7 2 ) + 3.t(J 7 3 + ) 

= 3x(T) — x(J- 2 ) — 2x(T\). 

Now plug in the bound from (|4.1I) and (14.21) to find 

k\M\ > 3x{E) - (1 + e)\M\ — 2\M\, 


which can be rewritten as 

x{F) < k + 3 3 + £ \M\ < Q + 1 + e) \M\. 


□ 


4.3 A polynomially sized LP 


A new bound on the integrality gap of the intersecting family LP has now been established, 
but the LP might have exponentially many constraints and thus not solvable in polynomial 
time. This section treats the result from |CL10j that for constant k only a polynomial 
number of constraints can be added to the standard LP formulation such that every 
intersecting family has a fractional value of at most 1. To this end the definition of a 
kernel is needed. Intuitively, for an intersecting family K its kernel is a subset U of the 
vertices covered by K such that all hyperedges restricted to U still form an intersecting 
family. More formally, for each hyperedge e define ejj = e n U, and for a collection of 
hyperedges K define Kjj = {e u \ e € K}. Then U is a kernel for an intersecting family K 
if Kjj is an intersecting family. 

We proceed with the following result from }CK64 j. 


Lemma 18. (ICK 64 J) For every k there exists an f{k ) such that for every k-uniform 
intersecting family K there is a kernel S of cardinality at most f(k). 


The point of this lemma that will be exploited is that the size of this kernel f(k ) is 
independent of n or the number of vertices of the hypergraph. This was a well-studied 
topic and the interested reader may read [AF871IEM721IEL731IFF861 IFra781 ITuz85| . Now 
we will repeat the result from [CLIO ] that shows how to apply this lemma to prove the 
following theorem, with our improved bound on the integrality gap. 


Theorem 19. Let e > 0. There is a polynomially sized LP for k-set packing with inte¬ 
grality gap at most | + 1 + e. [NOTE: see page in] 
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Proof. We will prove it is possible to find a polynomially sized LP that still captures all 
the constraints of the intersecting family LP. This immediately implies the LP has the 
claimed integrality gap by Theorem [T7] We follow the proof from fCLIOj . 

Let G = (V, E) be the hypergraph and create a variable xu for every subset U C V 
that is a subset of some hyperedge e € E. To enforce that xjj represents the fractional 
value of U, add the constraint xjj = Y! e ' 3 U x e- Each new variable U is a subset of some 
hyperedges in G , and U is said to be contained in a subset S if JJ C S. 

Now enumerate all possible subsets S C V with 15*1 < f(k). For each such subset S, 
enumerate all possible intersecting families Kg formed by the new variables contained in 
S. For each such intersecting family Kg write the following kernel constraint. 

^ x v < 1. 

UeK s 

There are Yli=i (1) — n/( fc )+! possible kernels. For each kernel S with | S'! = t there are 
at most 2* new variables contained in S , because there are at most 2 4 subsets of S. Hence 
there are at most 2 2 * intersecting families Kg induced in S, because there are at most 2 2 * 
hypergraphs in S. Every such intersecting family corresponds to one constraint, so there 
are no more than n^ k ' >+1 2 2f(k) kernel constraints. So when k is a constant this is a number 
polynomial in n. 

By Lemma [18] each intersecting family has a kernel constraint and hence each inter¬ 
secting family has at most a fractional value of 1. So indeed, the intersecting family LP 
can be rewritten into a polynomially sized LP for every constant k. □ 





33 


Chapter 5 

SDP formulation 


The first two sections treat background on semidefinite programming and the Lovasz Theta 
function. These concepts are needed for the proof in Section T5.31 of the theorem that there 
exists a polynomially sized SDP with the improved integrality gap from Theorem 1 171 

5.1 Background on semidefinite programming 

A semidefinite program is a more general form of a linear program. General references for 
a thorough background about SDPs are [BTNOll lFre04l lLov95l IPis06l IVB94] , 

In a linear program, the objective is to maximise a linear function over a convex 
polyhedron. In a semidefinite program the numbers are substituted by vectors and the dot 
product of two vectors is used instead of the multiplication of two numbers. A semidefinite 
program can be written in the following form. 

T 

max c x 

s.t. x\A\ + ... + x n A n — By 0 

Here x € M n is the vector of decision variables one needs to assign values to in order to 
maximise the inner product c T x with the given vector c E M n . A\,... ,A n ,B E S m are 
given symmetric m x m matrices, so one can think of X := x\A\ + ... + x n A n — B as 
a matrix whose entries are linear functions over the variables x*. The constraint X y 0 
means X needs to be positive semidefinite, which is equivalent to y T Xy being nonnegative 
for all y E K m or to X having only nonnegative eigenvalues. When X y 0 for some x E 
we say x is a feasible solution. Since both the objective function and the constraints are 
convex in x, a semidefinite program is a convex optimization problem. In contrast to a 
linear program, its feasible region is in general not a a polyhedron. 
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5.2 Background on the Lovasz Theta function 

This section gives some background on the famous Lovasz Theta function introduced in 
[Lov79j . We introduce the Lovasz Theta function via orthogonal representations. In order 
to do that some background about the stable set polytope is first given in Subsection 
15.2.11 Subsection 15.2.21 will talk about orthogonal representations and then the Lovasz 
Theta function is introduced in Subsection 15.2.31 General references for this section are 
[BTNOll IGLS881 IKnu941 ILov95j . 

5.2.1 The stable set poly tope 

Definition Stable set is another name for an independent set. Given a graph G = (V. E ), 
a(G) denotes the size of the maximum independent set in G. For every subset of the 
vertices S C V its incidence vector is denoted by x S £ i.e. for all i £ f, xf = 1 if 
i € S and xf = 0 otherwise. Now define the stable set polytope STAB(G') as follows. 

STAB(G) = conv.hull(x 5 £ M v \ S is an independent set in G) 

Properties So STAB(G) is the smallest convex set in BT containing the incidence vec¬ 
tors of all independent sets. Since all extreme points of this polytope are 0,1-vectors, there 
is a system of linear inequalities describing this convex set. Theoretically it is possible to 
find a(G) by optimising the linear objective function Yli x i over STAB(G). However, the 
number of constraints is generally exponential in |Vj so this is not an efficient approach to 
find a(G), which should be expected as determining a(G) is NP-hard. What one can do, 
however, is to find extra inequalities for the stable set polytope and find upper bounds for 
a{G). 

The clique constrained stable set polytope With the intersecting family LP in 
mind it is natural to start with the following inequalities for x € . 

Xi > 0 i € V, (5-1) 

Xi <1 for all cliques Q in G. (5-2) 

i&V{Q) 

Now define the clique constrained stable set polytope as follows. 

QSTAB(G) = conv.hull(x € M' | Constraints (15.11) and (15.21) hold). 
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Now any independent set in G corresponds to an integral vertex in QSTAB(G) and vice 
versa. The clique constrained stable set polytope is the first approximation of the stable 
set polytope that is considered here, but to formally define the Lovasz Theta function 
another polytope is introduced in the next subsection. 


5.2.2 Orthogonal representations 

Definition This subsection introduces the Lovasz Theta function via orthogonal repre¬ 
sentations. Let G = (V. E) be a graph and let E = {{i,j} € V x V \ {i,j} 0 E} be the 
complement of E. Formally, an orthogonal representation of G is a mapping (labeling) 
u : V — )• W 1 for some d such that ufuj = 0 for all {//', j} € E. In other words we need to 
assign a vector u v to every vertex v such that the vectors of any two non-adjacent vertices 
are perpendicular to each other. Such a mapping always exists, in fact, there are two 
trivial mappings: map all vertices to 0, or map the vectors to a set of mutually orthogonal 
vectors in R 1 . 


Orthogonal constrained stable set polytope An orthogonal representation (tq \ 
i € V) with m € R rf is called orthonormal when every vector has unit length, i.e. when 
||tq|| = 1 for all i E V. Let c be some vector in R d with ||c|| = 1 (for example, take c = ei). 
Then for any stable set S C V its vectors {ui \ i E S} are mutually orthonormal as the 
vertices are non-adjacent, and hence 

E (^) 2 - L 

ies 

This is true because the left-hand side is the squared length projection of c onto the 
subspace spanned by the iq. The length of this projection is at most the length of c which 
is 1. In fact, note that ^2 i£V (c T ui) 2 x s = Ui) 2 , which yields that the following 

inequality holds for the incidence vector x S °f an Y stable set S C V. It is called the 
orthogonality constraint. 

J2(c T Ui) 2 Xi < 1 . ( 5 . 3 ) 

i£V 

Similar like before, we can now define the orthogonal constrained stable set polytope as 
follows. 


TSTAB(G) = conv.hull(x € R v | Constraints (15.11) and (15.31) hold). 
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5.2.3 The Lovasz Theta function 

The most interesting property of TSTAB(G) is the fact that one can optimise linear 
functions over it in polynomial time |GLS88i Theorem 9.3.30]. We can now succinctly 
define the Lovasz Theta function. 

0(G) = max j^x,; | x € TSTAB(G) 

The following is equivalent by writing out the definitions. Let ONR denote an orthonormal 
representation. Denote the following LP by (10-LPI) . 

0(G) = max ^2 Xi 
i 

s.t. ^(c T rtj) 2 Xj <1 Vc: ||c|| = 1 VONRjrtj} (0-LP) 

i&V 

Xi > 0 Vi € V 



There are a lot of alternative and equivalent definitions for the Lovasz Theta func¬ 
tion. The semidefinite program for k -set packing uses the following equivalent definition, 
known in the literature as 63 (G). An orthogonal representation {6j} is called normalised 
if Yli \\bi \\ 2 = 1, and define G = (V,E). 

163 (G) = max < b u b v \ b is a normalised orthogonal representation of G 

V u,v 



5.3 An SDP for k- set packing 


This section contains the main theorem of this chapter. As before, view fc-set packing as 
the independent set problem in a k + 1-claw free graph and consider the following clique 


LP. 


max 



s.t. x € QSTAB(G) 


(Clique LP) 


This is optimising the size of an independent set over the clique constrained stable set 
polytope. 


Lemma 20. (Clique LP) is equivalent to the intersecting family LP of Chapter [7} 


Proof. The nonnegativity constraints x* > 0 for (Clique LP) obviously match the same 
constraints in the intersecting family LP. The clique constraints YlieQ Xi — 1 f° r cliques 
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of size 1 imply the bound X; L < 1. Evidently they also imply the intersecting family 
constraints x(K ) < 1 for Q = K. The constraints that x(5(v)) < 1 are also implied by the 
clique constraints: all hyperedges covering element v form a clique in the conflict graph. 
The other way around is similar. □ 

Note that replacing QSTAB(G) by TSTAB(G) yields i?(G). These are related as 
follows. 


Lemma 21. (ICL 10 1 Lemma 4-3]) Any feasible solution to (1V-LPI) is a feasible solution 
to ( Clique IT 7 ] ) . 


Proof. It suffices to show that the orthogonality constraints (15.31) imply the clique con¬ 
straints (15.21) . Let Q be a clique in G and map all vertices of Q to c and all other vertices 
to mutually orthogonal vectors that are also orthogonal to c. Then the orthogonality 
constraint for Q implies its clique constraint. □ 


In other words, for every graph G 


STAB(G) C TSTAB(G) C QSTAB(G). 


Finally all linear and semidefinite programs can be linked. 

Lemma 22. 113 (G) is a stronger relaxation than the intersecting family LP. 

Proof. By Lemma I2T1 (Id-LPI) is a stronger relaxation than the clique LP. Hence by Lemma 
1201 fld-LPI) is also stronger than the intersecting family LP. (1V-LPI) is equivalent to il(G), 
which is equivalent to $ 3 (£7). Hence 113 (G) is a stronger relaxation than the intersecting 
family LP. □ 


113 (G) can be written as follows. 


■$3 (G) = max 


UiU i 

i,j&V 


s.t. UiUj = 0, V(i,j) € E 

n 

i =1 

Ui € R d , Vi € V 


(03-LP) 


This is a semidefinite program which is a stronger relaxation than the intersecting family 
LP. Then by Lemma [22] and Theorem 1 171 the following is true. 
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Theorem 23. Let £ 0. ( ]L;^- LP[ ) is an SDP relaxation for k-set packing with integrality 

gap at most | + 1 + e. [NOTE: see page Hi] 

This is Theorem 1.5 from [ CLIO ] with our improved bound on the integrality gap. 
Since this semidehnite program has a polynomial size, the following theorem is also true, 
similar to Theorem m 

Theorem 24. Let e > 0. There is a polynomially sized SDP for k-set packing with 
integrality gap at most | + 1 + e. [NOTE: see page Hi] 
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Weighted approximation 

In this chapter we give a simplified proof of the main lemma of Berman’s paper |BerOO| 
containing the currently best approximation algorithm for weighted k -set packing with 
approximation guarantee . First the necessary terminology is introduced in Section 
16.11 Then the algorithm is discussed in Section 16.21 and Section 16.31 provides a new proof 
of the main lemma. 


6.1 Terminology 

Claws Consider the following setting from jBerOOj for this chapter. For a graph, define 
a k -claw C as a subgraph isomorphic to Ki : k, the complete bipartite graph on 1 and k 
vertices. For convenience, define a 1-claw to be a singleton set C with Tq = C. A claw is a 
k -claw for some k. Define the single vertex connected to all other vertices of the claw to be 
the center Zq- The other vertices of the claw (forming an independent set by definition) 
are called the talons Tc of the claw. A k -claw has k talons and one center vertex. Write 
C = Zq U Tc for a claw C with center vertex Zq and talons Tc- 


Approximation guarantee Let G = (V,E) be a k -claw free graph with a weight w(v) 
for every vertex v € V. The main theorem of jBerOOj is a ^-approximation algorithm for 
maximum independent set in such a graph. This yields a ^^--approximation algorithm for 
weighted k -set packing because any packing corresponds to an independent set in a k + 1- 
claw free graph. The algorithm searches for claws satisfying certain properties and then 
adds the talons of the claw to the current independent set A and removes the neighbours 
of the talons in A. 
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Notation We will use the following notational conventions. Define for a vertex v E V 
its open neighbourhood (or just its neighbourhood) N(v) = {w G b {v, w} E E} and its 
closed neighbourhood N[v] = N(v) U {u}. For a subset of the vertices W C V define its 
closed neighbourhood N[W] = IV [w;] and write N(W) = IV [W] \ W for the (open) 

neighbourhood of W. 

Definition 25 . For two given subsets of the vertices U and A write N(U,A ) = N(U)nA, 
i.e. the neighbourhood of U in A. 

N(U, A) is called the A-neighbourhood of U and we refer to the vertices in N(U, A) as 
the A-neighbours of U. Write N(u,A ) for N({u}, A) for some vertex u and some subset 
of the vertices A. 

Definition 26 . By n(u,A), denote a vertex v E N(u,A ) with maximum weight. 

If this is not uniquely defined, simply choose a random vertex of the different possibil¬ 
ities. 

For convenience we introduce the notation w(U) = Y1 v gU w ( v ) f° r some subset of the 
vertices U C V. Even shorter, we write the following. 

Definition 27 . w(U,A ) = w(N(U,A )) and w(u,A ) = w(N(u,A)). 

These are the sums of the weights of the A-neighbours of a subset of the vertices U 
respectively one vertex u. These notations extend to different weight functions such as 
w 2 , in particular note that w 2 (U,A) = Y1 v &n(u A) w2 ( v ) / {w{U, A)) 2 . Now define the 
following function as in (BerOPj . 

I w(u) — \w(u,A), if v = n(u,A); 
charge{u , v) = < 

I 0, otherwise. 

Now define the following. 

Definition 28 . Let A be an independent set in a graph G = (V. E) . Define a good claw 
C = Zc U Tc to be a claw satisfying either of the following properties. 

1. N(Tq,A) = 0, i.e. adding Tc to A to obtain another independent set does not require 
the removal of any sets in A; or 

2. The center vertex Z c is in A and J2 U £T c °harge(u,v ) > \w(v). 

A claw C is called a nice claw if it is a minimal set forming a good claw, i.e. if there is 
no strict subset of C forming a smaller good claw. 
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6.2 Two algorithms joining forces 

6.2.1 The algorithms Squarelmp and WishfulThinking 

In this setting, define the following algorithm: 

SquareImp 

A^0 

While there exists a claw C such that Tc improves w 2 {A) 

AUT C \N(T C ,A) 

Now define the following algorithm: 

WishfulThinking 

A^0 

While there exists a nice claw C 
A^ AUT C \N(T C ,A) 

These two algorithms can now be linked in the following way. 

1. Every nice claw improves w 2 (A), so a run of WishfulThinking forms the initial 
part of a run of SquareImp. See Section [6T3l 

2. Consequently, WishfulThinking cannot make more iterations than SquareImp. 

3. When SquareImp terminates it yields an independent set A for which no claw im¬ 
proves w 2 (A). Hence there is no more nice claw, so WishfulThinking terminates. 

4. If WishfulThinking terminates, its approximation guarantee is |. See Subsection 

16X21 

The proof of the approximation guarantee is repeated in Subsection 16.2.21 to get some 
insight into the non-intuitive definitions of the charge function and good claws. The main 
lemma is the fact that every nice claw improves w 2 (A), for which we give a simplified 
proof in Section 16.31 


6.2.2 The approximation guarantee 

We repeat the following proof of (BerOOj . 


Lemma 29. fJBerOO l Lemma 1]) Assume that WishfulThinking has terminated and 

that A* is an independent set. Then — ^ .) < —. 

F w(A) ~ 2 
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Proof. Let G = (V, E) be the graph and let A be the independent set that has been found 
using WishfulThinking. Let A* be any independent set in G (in particular it could be 
the maximum independent set). We will distribute w(A*) among the vertices of A such 
that no vertex v € A receives more than %w(v). This immediately implies the claimed 
result. The distribution consists of two phases. 

In the first phase, every vertex u € A* sends to each of its A-neighbours v € N(u, A) a 
portion of its weight equal to \w(y). Note that N(u,A ) 7 ^ 0 because otherwise {u} would 
be a nice 1-claw and these do not exist when WishfulThinking has terminated. 

In this first phase, every vertex u sends a portion of its weight equal to ^w(u,A). By 
the definition of the charge function, the portion of its weight that is not distributed yet 
equals charge(u,n(u, A)). In the second phase u sends charge(u,n(u, A)) to n(u,A). 

Now consider some vertex v € A in the receiving side of this distribution. In the first 
phase v gets 7 w{y ) from all its neighbours in A*. Because A* is an independent set and 
the graph is fc-claw free, v has at most k — 1 neighbours in A*. Thus v gets at most 
(k — l)|u;(u) = ^w(v) — \w{y) in the first phase. 

In the second phase, v receives exactly Y1 U £N(v ^ 4 *) charge(u,v). By the definition of 
a good claw, this can be at most | w(v ): otherwise the vertices in A* sending positive 
charges to v form the talons of a good claw with v £ A at its center. 

Hence every vertex v € A receives at most ^w(v) — \w(v) + \w{v) = | w(v ), and hence 
the weight of A* is at most § times as much as the weight of A. □ 

As noted, this yields a ^-approximation for the weighted independent set problem in 
Ai-claw free graphs, which results in a approximation for weighted fc-set packing. 

Instructive example Berman |BerOO| proceeds with an example of an instance where 
an iteration of WishfulThinking in fact decreases w(A), which is the function it is in fact 
trying to maximise. This is contra-intuitive: we are using a local search technique and from 
a given solution we might move to a next solution with a worse objective value. However, 
as we will prove in Section Tb.31 an iteration of WishfulThinking always increases w 2 (A ) 
and that suffices for the analysis. 

Here is the example, see Figure l 6 Al Let S' be a subset of the current independent set A 
depicted at the bottom of Figure l6Al and let T be a subset of the vertices in V\ A depicted 
at the top. Write S = {si,... , 55 } with ie(sj) = 10 for i = 1,..., 5, and T = {^i, ^ 2 } with 
w(t\) = w(t 2 ) = 18. Let n(ti,A ) = S 3 for i = 1,2. 

We claim that {S 3 } U (i 1 , ^ 2 } is a nice claw. To see this, note that charge(ti, S 3 ) = 
w(ti)-\w(ti,A) = 18-|(10-|-10-|-10) = 3. So for S 3 we have charge(ti, S 3 ) = 3+3 = 6 , 
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18 18 



Figure 6.1: An example where an iteration of WishfulThinking actually decreases w(A). 
The bottom vertices are in S C A and the top vertices are in T C V \ A. 


which is larger than ^w(s 3) = 5. So all conditions are satisfied and {S3} U {ti, ^2 } is a 
good claw, and because it is minimal it is also a nice claw. 


However, adding T and removing N(T, S) = S means adding two sets of weight 18 
and removing 5 sets of weight 10. Hence w(A) decreases by 14. But the squared weight 
function increases: the gain is twice 18 2 and the loss is five times 10 2 , so it increases by 
648 — 500 = 148. In fact, elementary calculus shows that w c increases in this example 


for c > log5 - log2 
iul c ^ log 18 -log 10 


1.56, or in a more general setting, for c > 


log|S|—log IT 


logw(si)-logw(ti) ■ 

[BK03| for results on using the misdirected weight function w c for some c / 1. 


See also 


6.2.3 A new observation 


For the independent set problem in fc-claw free graphs the use of w 2 rather than w can be 
advantageous due to the following observation. Let A be some independent set and let u 
be some vertex not in A. 


E 

teN(u,A) 


E N*) 

teN(u,A) v 


max 

t&N(u,A) 



( max 
\teN(u,A) 


w ( t )) E 

' teN(u,A) 


wit'). 


This proofs the following observation. 


Observation 30. w 2 {u,A) < n(u, A)w(u, A). 


So the squared weight function of a set of vertices is capable of capturing information 
not only about the sum of the weights but also about the maximum weight. Using this 
simple observation Berman’s proof can be simplified in the next section. 
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6.3 Simplified proof 

Here is a simplified proof of the main lemma from [BerOOj proving that adding a nice claw 
to the current independent set A improves w 2 (A). We believe this proof gives some more 
insight into what is really happening behind the math thanks to Observation I30L 

Lemma 31. If C is a nice claw, then Tc improves w 2 {A). 

Proof. Let A be the current independent set and let C = Zq U Tq = {u} U T be a nice 
claw. We need to show that the weight of what is added (T) is more than the weight of 
what is lost (N(T, A)), so we need to show that 

w 2 (T) > w 2 (T, A), (6.1) 


or equivalently, 

w 2 (T ) — w 2 (T, A — {u}) > w 2 (v). 

To proof that for a nice claw (16.21) holds, we proof the following claim. 
Claim 1. Let C = {u} U T be a nice claw. Then 


( 6 . 2 ) 


w 2 (T , A — {u}) < ^2 (w 2 (u, A) — w 2 (v)) . 

u&T 

Proof. By definition, w 2 (T,A — {u}) is the sum of the squared weights of all neighbours 
of the talons T in A excluding v. Summing over T rather than the neighbourhood of 
T, this can be bounded by w 2 (u,A — {u}). This is an upper bound, because in this 

u£T 

expression vertices that are neighbours of more than one vertex in T are counted multiple 
times. Therefore, w 2 (T, A — {v}) < w 2 (u, A — {u}). Noting that w 2 (u, A — {v}) equals 

u£T 

w 2 (u,A ) — w 2 (v), the claim follows. □ 

For the first term in (16.21) . write w 2 {T ) = YhieT u,2 ( u )- N° w by Claim [TJ the following 
implies (J6.2D . 

(u; 2 (u) — w 2 (u, A) + w 2 (v)) > w 2 {y). (6.3) 

To show that (16.31) holds when C is a nice claw, we proceed to the second claim. 

Claim 2. Let C = {v} Li T be a nice claw. Then v = n(u , A) for all u in T and 


w(v) < 2 charge(u , v ). 

u£T 


(6.4) 





6.3. SIMPLIFIED PROOF 


45 


Proof. Equation (16.41) just follows from the definition stating YlueT charge(u, v) > ^w(v). 
Also, as C is a nice claw, it is minimal, implying that every term on the right-hand side of 
()6.4I) is positive. By the definition of charge , this is true only if v is the maximum weight 
neighbour of u within A. □ 

So proving that (|6.3I) holds when C is a nice claw has now been reduced by Claim [ 2 ] 
to showing that whenever charge(u , v) > 0 we have 

w 2 (u ) — w 2 (u, A) + w 2 (v) > 2w(v)charge(u , v). 

Here we plugged in (16.4)1 only once in the right-hand side. By the definition of charge , 
this boils down to proving that 

w 2 {u) — w 2 (u, A) + w 2 (v) > 2 w(u)w(v) — w{v)w{u , A) (6.5) 

holds whenever v is the maximum weight neighbour of u. 

From this point on we will deviate from the proof of Berman [BerOO] . He now scales 
the quantities, makes a case distinction and rewrites the equations algebraically until it 
is clear they are indeed true. However, (j6.5jl can be shown more easily using Observation 
[30l Plugging this into (16.51) yields 

w 2 {u ) — w(v)w(u , A) + w 2 (y ) > 2 w(u)w(v) — w(v)w(u, A). 

The terms w(v)w(u, A) now cancel and what is left is 

w 2 {u) + w 2 (y) > 2 w(u)w(v), ( 6 . 6 ) 

which is obviously true as this is equivalent to 

(w(u) — w(v)) 2 > 0. 


We have now proved that when C is a nice claw, m holds, and thus Tq improves 

w 2 {A). □ 
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Discussion 


In this chapter we will discuss possible directions for further research. The LP and the 
SDP relaxations for the problem are discussed in Section [77T1 Section [T72l considers possible 
ways in which the weighted approximation algorithm of Berman [BerOOj could be lightly 
changed. Section 17.31 continues with a discussion about the difference in the results on 
the independent set problem in bounded degree graphs and the current results on k -set 
packing. Finally Section 17.41 discusses the problems arising when one tries to generalise 
the unweighted approximation algorithms with weights. Sections 17.1117.31 and 17.41 suggest 
future research directions and contain some conjectures. 

7.1 LP and SDP relaxations 

Here is a summary of the current results. 

1. The standard LP relaxation for k -set packing has integrality gap k— 1+p In the case 
of ^-dimensional matching (i.e. when the hypergraph is A;-partite) the integrality gap 
equals k — 1. Chan and Lau [ CLIO] gave algorithms for these cases. These results 
and algorithms also extend to the weighted case. 

2. The intersecting family LP for k-set packing has integrality gap at most | + 1 + e 
(Theorem [l7|). It is not known whether this result also extends to the weighted case. 

3. By the results of [CLIO] , there also exists a polynomially sized LP for fc-set packing 
with integrality gap at most | + 1 + e. We don’t know whether this is also true for 
the weighted version either. 

4. Also by [CLIP] , the Lovasz Theta function is at least as strong as the intersecting 
family and therefore this SDP relaxation has integrality gap at most | + 1 + e. 
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7.1.1 Extending results to the weighted case 

Unweighted versus weighted While the results for the standard LP extend to the 
weighted case, the results for the intersecting family LP do not extend in an obvious 
way. This is for the same reason why the unweighted approximation algorithms for fc-set 
packing do not easily generalise to the weighted case (c.f. Section E3D: the local search 
technique is relying crucially on cardinality. For example, T\ (e) in the proof of Theorem 
m need not be an intersecting family in the weighted case: the objective function might 
increase even by adding less sets than are removed. We elaborate on this in Section [7.41 

Possible research directions Perhaps there is another way to partition the hyperedges 
rather than in J -\, JL and F^ + that does provide a way to extend the result to the weighted 
case. For the weighted problem the setting of Ui, Ti and F% + does not make sense and 
nothing can be proved. In the weighted case it seems to make sense to define a f-locally 
optimal solution as a solution where adding at most t new sets and losing any number of 
sets does not yield an improvement (instead of losing less than t sets in the unweighted 
case). But perhaps in the weighted case it will prove to be worthwhile to consider the 
integral optimum solution rather than a t-locally optimum. 

The proof of the existence of a polynomially sized LP in Theorem [T9] depends on the 
existence of small kernels for every intersecting family. If a new result on the integrality 
gap of some LP for weighted fc-set packing relies on intersecting families, this result still 
holds. The SDP relaxation still captures all intersecting family constraints in the weighted 
case (one can just add weights to the objective function and nonnegativity constraints on 
them), so if one proves a result for weighted k- set packing using these intersecting families, 
that result immediately extends to the polynomially sized LP and SDP relaxation. 

Problem 1. Narrow the gap between the integrality gap for relaxations of the weighted 
and the unweighted k-set packing problem. 

7.1.2 Smaller bound on integrality gap 

Another way to improve upon the current results is to further decrease the upper bound 
on the integrality gap on LP relaxations like the intersecting family. This seems likely to 
be possible, because all unweighted approximation algorithms achieve bounds of pp + e 
or pp + e while the current result is | + 1 + e. Perhaps some ideas from these algorithms 
can be extended to the integrality gap of the LP relaxation. 
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Problem 2. Improve the integrality gap of relaxations for the unweighted k-set packing 
to pp + e or better. 

Towards a gap of pp + e In particular we would like to point out that it may be 
worthwhile to see if the idea from the quasi-polynomial time pp + e from |CGM13j can 
be extended. The proof of this approximation guarantee also depends on Lemma [TJ] as 
did our new bound of | + 1 + e. Their algorithm uses slightly more crafted ideas, but 
these do not seem to extend to the integrality gap of the LP in a straightforward way. 
Roughly speaking they are able to bound T\ by s\M\ rather than |M|. This is then added 
twice like in the proof of Theorem [171 This effectively decreases the bound by ||M| which 
is exactly the current difference between the results. Perhaps by altering the argument a 
little bit, one could use the idea of this proof to improve the bound for the integrality gap 
for the intersecting family LP to, say, pp + e. 

Towards a gap of ^p + e Also the idea from the ^^-approximation by Sviridenko 
and Ward |SW13| might be interesting to take a closer look at. Very roughly speaking 
they bound Ti by 2\M\J-\ |. This cancels against the bound for T\ and hence they obtain 
3|.F| < k\M\ + 2\M\. A similar idea could perhaps be used to bound the LP-values of the 
sets T\ and JL. 

Since the local search techniques have reached their limits in terms of their approx¬ 
imation guarantee, other techniques have to be sought to improve the approximation 
guarantee. The results on the LP and SDP relaxations for k- set packing are scarce and 
we believe more research in this area could turn out to be fruitful. We make the following 
conjecture. 

Conjecture 3. The integrality gap of relaxations for the unweighted k-set packing can be 
bounded by pp + e. 

7.2 Improving Berman’s weighted approximation 

7.2.1 Generalising charge and claws 

The analysis as presented in Section 16.31 shows that slight changes to the algorithm do not 
simply constitute an improvement of the approximation guarantee to h for some c > 2. 
Let us replace the \ i n the definition of charge(u , v ) to ^ and the -y in the definition of a 
nice (good) claw to p 
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Approximation guarantee Lemmal29lcan now be easily adapted to hold for any c > 2. 
The proof remains exactly the same and WishfulThinking now has an approximation 
guarantee of 

Nice claw improves w 2 (A) However, Lemma [311 is no longer true and this can be seen 
from our simplified proof. In the proof that the talons of a nice claw improve w 2 (A ) when 
the definitions of charge(u,v) and of a nice claw have been altered, the analysis remains 
exactly the same except for the fact that in Equation (16.61) . the factor 2 in the right-hand 
side changes into a c: 


w 2 (u) + w 2 {y) > cw(u)w(v), (7.1) 

However, Equation (17.111 is only true for all u and v for the values c = 0 and c = 2. Since 
c = 0 is not possible because c occurs in the denominator in the definition of charge(u, v) 
and of a nice claw, w 2 (A) improves only when c = 2 when following the current analysis. 

It is possible to incorporate an extra constraint in the definition of a nice claw to 
make sure that Equation m holds for some c > 2. Then Lemma EH is true because of 
the altered definition. However, the extra constraint now causes trouble in the proof of 
Lemma 1291 which is then not necessarily true anymore. 

7.2.2 Generalising the weight function 

Intuitively, what is the crucial point why w 2 might behave differently than w7 In general, 
looking at the squared weight function (or at w c for any c > 1 ) is slightly more biased 
towards larger weights. When a vertex has some weight m and we add 1 to its weight, w 
increases by 1 while w 2 increases by 2m + 1. Also the square of the weight of one vertex 
might be more than the weight of two vertices, w 2 prefers one vertex of weight 3 to two 
vertices of weight 2 , while w prefers the two vertices of weight 2 to the single vertex of 
weight 3. So by guiding the search by w 2 rather than w, an iteration might decrease the 
real objective function but it is more difficult to get stuck in an inferior locally optimal 
solution and hence a better result might be achieved in the end. 

One could also try to improve w p for some 2 < p E N next to the use of the parameter c 
rather than 2. Following the same analysis as in Section 171701 Equation (16.61) then translates 
to 

w p (u) + w p (v) > cw(u)w p ~ l (v). 

This is trivially true for c = 0 and even p, for p = c = 1 and for p = c = 2. Again the 
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first is not a real option, the second option gives an approximation guarantee of j which 
is not an improvement and the third option is the one that leads to Berman’s result. This 
shows that using the current analysis this is the best possible result, so to improve the 
approximation guarantee one really needs another analysis or another approach. See also 
[BK03| for other reasoning about generalising Berman’s algorithm. 

7.3 Relation to the independent set problem 

Here is a summary of the relation between fc-set packing and the independent set problem. 

1. Finding a maximum set packing is equivalent to finding a maximum independent set 
in the conflict graph of the set packing instance. 

2. Finding a maximum k -set packing can be reduced to finding a maximum independent 
set in the k + 1-claw free conflict graph of the k -set packing instance. 

7.3.1 Results on the independent set problem 

General graphs The following results are known for the maximum independent set 
problem in general. Let n be the number of vertices in the graph. 

1. The problem is NP-hard and it is hard to approximate within n 1-£ unless NP-hard 
problems have randomised polynomial time algorithms |Has96j . 

2. There is an approximation algorithm that achieves an approximation guarantee of 

®(i#d ™' 

Bounded degree graphs When the maximum degree of every vertex is assumed to 
be bounded by some A, approximating the problem becomes considerably easier. The 
following results are known for the maximum independent set problem on bounded degree 
graphs. 

1. Assuming the Unique Games Conjecture |Kho02| it is hard to approximate within 

o(e?a) EES®. 

2. The greedy algorithm is an obvious A-approximation. Hochbaum [Hoc83l was the 
first to give an approximation algorithm with approximation guarantee which 
was improved to |HR94al IHL97j . Berman and Fiirer [BF94] give a + 
^-approximation for even A and a ,25 + e-approximation for odd A, which 


























52 


CHAPTER 7. DISCUSSION 


was slightly improved in [BF95j . Then a major jump came with an O 
approximation, a ^(1 + o(l))-approximation, a proof that greedy achieves and 
that Berman and Fiirer really achieved IHR94bj . The currently best result is an 
O ^ A A ) -approximation in polynomial time that also extends to the weighted 
case |Hal()2l lHa!99i !Vis96j . 

7.3.2 From bounded degree to claw-free graphs 

Comparison The results on the maximum independent set problem in bounded degree 
graphs are much stronger than the results on fc-set packing. For example, the gap between 
its hardness (O ( i,,^ a )) an< ^ ^s a PP rox i ma tion guarantee (O (AlogsA)) is mlKh 
smaller versus + e). The constraint that the degree is bounded apparently 

allows much more arguments than the constraint that the size of an independent set in 
the neighbourhood of every vertex is bounded. 

Problem 4. Narrow the gap between the approximation guarantee of k-set packing and 
the independent set problem on bounded degree graphs. 

Mimicking the bounded degree algorithm As an example, consider the O ^ A 

approximation for weighted independent set in bounded degree graphs by Halperin [Hal02l 
Section 5]. In a nutshell this solves a semidehnite programming relaxation and partitions 
the resulting vectors in sets So, S\ and S- 2 - It then uses the greedy approach on So to find 
an independent set Iq, projects and normalises the vectors in S± and selects some of them 
to find I 1, and just sets I2 = S'2. It then returns the largest weight independent set from 
Jo, I\ and I2 . By a good choice of a parameter the approximation guarantee is achieved. 

If one tries to use the same approach on the independent set problem in k + 1-claw free 
graphs it is the set So that is causing trouble. In the bounded degree case it is trivially true 
that the greedy algorithm produces an independent set Jo of total weight at least 
where w(So) is the sum of the weights of the vertices in So- Intuitively So is already a 
good structured set so it is not needed to do anything smarter than the greedy algorithm. 

However, in the k+ 1-claw free case the greedy algorithm does not have such a sufficient 
performance guarantee. It has a performance guarantee of k, so the value of the greedy 
solution can be compared to the optimal independent set size in So- But it can not be 
compared to the weight (or the cardinality) of So- 

Intuition and research direction Morally it seems there should not be such a differ¬ 
ence between the independent set problem in bounded degree graphs and claw-free graphs. 
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Look at a vertex v in a claw-free graph and at its neighbours N(v). As the maximum 
size of an independent set in N(v) is at most k, one could look at N(v) as the union of k 
cliques (in relation to the set packing instance: one clique for the sets that all share one of 
the k elements, modulo some duplicates). And for the independent set problem, a clique 
is not that different from a vertex: it is only possible to pick one of the vertices. It would 
be an interesting research direction to see where exactly the analogy with the bounded 
degree graphs stops, or to somehow change the algorithm for the bounded degree graphs 
and achieve an improved approximation guarantee for claw-free graphs. 

The intuition in this paragraph inclines us to believe the approximation guarantee for 
k- set packing can be brought down further. Perhaps it it not which is the best known 
hardness bound, but we make the following conjecture. 

Conjecture 5. The approximation guarantee for k-set packing can be bounded by some 
function strictly smaller than 

7.4 Unweighted versus weighted k -set packing 

General extension For most problems the weighted version is not much more difficult 
than the unweighted version. For example, also weighted matchings can be found in graphs 
in polynomial time, and for a lot of problems adding weights to the LP-formulation does 
not change anything significantly. However, for k- set packing the difference between the 
weighted version and the unweighted version is nontrivial. 

Extending k -set packing As noted in Section 13. 1 1 the analysis of the unweighted case 
hugely depends on the cardinality of every set. This is because the algorithms rely on 
local search and analyse a locally optimal solution. However, adding more sets than you 
remove from your current solution may not be advantageous in the weighted case, as the 
total weight might decrease while the cardinality of the solution increases. And in the 
weighted case, it could be the case that the total weight increases when you add less sets 
than you remove from your current solution. This proves hard to be incorporated in the 
analysis, and the weights of the sets cannot be handled in a straightforward way. This is 
why the results for the unweighted case do not easily extend to the weighted case. There 
have been less results on the weighted case and the algorithms do not immediately follow 
from the unweighted results, although they all use local search techniques. 

Problem 6. Improve the approximation guarantee for weighted k-set packing. 
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Research direction The best weighted approximation algorithms reduce the problem 
to the independent set problem on k + 1-claw free graphs. There are no indications so far 
that better results can be obtained in the weighted fc-set packing problem by holding back 
from this reduction, so this seems a good way to look at the problem. These weighted 
approximation algorithms achieve approximation guarantees of approximately 4^- [CH99| 
and | (E erOOj . Section ID21 considered possible ways to improve upon this last algorithm 
by changing it a little bit, aided by the simplified proof from Chapter [6l Berman and 
Krysta jBKO.'l'j considered what values of w a are the best for every k when one searches a 
2-locally optimal solution and achieve an approximation guarantee of about A natural 
extension of this would be to use this alternate weight function in a search for f-locally 
optimal solutions for some t > 2. Considering such larger improving sets proved to work 
for the unweighted problem and it trivially works for the weighted case when one searches 
for improving sets of size n; the question is, how large do the improving sets have to be to 
get an improved approximation guarantee of, say, about |? Perhaps a value of t = 0{k) 
or t = 0(logn ) works. We think it is fruitful to investigate this possibility and either find 
a better approximation algorithm or an indication that this might not be helpful after all. 

We believe a better approximation guarantee could be obtained by increasing the 
search space and we make the following conjecture. 

Conjecture 7. The approximation guarantee for weighted k-set packing can he bounded 
by + e for some fixed c > 0. 
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Appendix A 

Parameterized complexity 


Complexity 

Reference 



Remarks 

0*(( 5.7m) m ) 

Jia, /hang and Chen JZC04] 

R 

3 


0*( 10.88 3m ) 

Koutis |Kou05| 

R 

k 


0*(16 m ) 

Chen et al [CLSZQTj 

R 

k 

Also for weighted problem 

0*( 2 3m ) 

Koutis |Kou08j 

R 

k 


0*( 1.493 3m ) 

Bjorklund et al [BHKK10 

R 

k 


0*(2°M(3m)!) 

Downey and Fellows |DF99j 

D 

3 


0*(25.6 3m ) 

Koutis Kou05l 

D 

k 

Originally 0*{ 2 °( m O) > 0*(32000 3m ) 




([CLSZ07, LLCS06 ) 

0*(13.78 3m ) 

Fellows et al FHR + 04 

D 

k 

Originally exp(0(mk )) = 0*(12.7D) 3m , 




D > 10.4 QCLSZ07, LLCS06]) 

0 *(12.8 3m ) 

Chen [ChelS] 

D 

k 


0 *(12.8 3m ) 

Liu, Chen and Wang [LCW07] 

D 

k 

Also for weighted problem 

0 *(12.8 3m ) 

Chen et al [CLSZ07| 

D 

k 

Also for weighted problem 

0*(7.56 3ra ) 

Wang and Feng [WF08a] 

D 

3 

Also for weighted problem 

0*(5.44 3m ) 

Chen and Chen ICC13| 

D 

k 


0*( 4.61 3m ) 

Liu et al [LLCS06 

D 

3 


0*( 4 3m ) 

Chen et al CKL+09 

D 

k 

Also for weighted problem 

0*(3.523 3m ) 

Wang and Feng [WF08b] 

D 

3 


0*( 32 m ) 

Feng et al fFLLW09] 

D 

k 

Also for weighted problem 


Table A.l: Parameterized complexity results of randomised (R) and deterministic (D) 
algorithms for 3 — SP. Column 4 indicates whether the result only applies to 3 — SP (3) 
or whether it is derived from a more general result from k — SP (k). 
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APPENDIX A. PARAMETERIZED COMPLEXITY 


Complexity 

Reference 


Remarks 

O*(10.88 mfc ) 

Koutis |Kou05| 

R 



Chen et al |CKL + 09 

R 

Also for weighted problem 

0*( 2 mk ) 

Koutis |Kou08j 

R 

f{m,k) « (-o.68 -) 

0*(f(m, k)) 

Biorklund et al [BHKKIOj 

R 

0*(g(k,m)) 

Jia, Zhang and Chen [JZC04] 

D 


0*( 25.6 mk ) 

Koutis [Kou05j 

D 

Originally 0*(2 °( m D) ([CLSZ07, [LLCS06]) 

0*( 13.78 mfc ) 

Fellows et al FKN+08 

D 

Originally exp(O(mk)) ([CLSZ07, LLCS06]) 

0 *(12.8 mfc ) 

Chen [Chel3] 

D 


0 *(12.8 mfc ) 

Liu. Chen and Wang [LCW07j 

D 

Also for weighted problem 

0*( 5.44 mk ) 

Chen and Chen [CC13| 

D 


0*Umkj 

Chen et al CKL + 09 

D 

Also for weighted problem 

0* ^2(2Ai— l)m^ 

Feng et al [FLLW09] 

D 

Also for weighted problem 


Table A.2: Parameterized complexity results of randomised (R) and deterministic (D) 
algorithms for k — SP. 
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